formatting issue

Mundane & Pointless Stuff I Must Share: The Off Topic Forum

Moderator: Moderators

Post Reply
ckafrica
Duke
Posts: 1139
Joined: Fri Mar 07, 2008 7:54 pm
Location: HCMC, Vietnam

formatting issue

Post by ckafrica »

Hey, I'm looking to compile info off several different pdfs into a single text file and I'm wondering if there is a quick way to de-columnize the text I cut and paste. I'm going to be mixing different original font sizes and column sizes so I want to be able to wipe out most of the original formatting so it fits the normal straight line margin to margin.

I've got MS office and Sun open office.

thanks in advance for the help (unless no one helps in which case pox on you all!!!!)
The internet gave a voice to the world thus gave definitive proof that the world is mostly full of idiots.
User avatar
Crissa
King
Posts: 6720
Joined: Fri Mar 07, 2008 7:54 pm
Location: Santa Cruz

Post by Crissa »

Past -> special -> without formatting

-Crissa
User avatar
Josh_Kablack
King
Posts: 5317
Joined: Fri Mar 07, 2008 7:54 pm
Location: Online. duh

Post by Josh_Kablack »

save as plaintext, or just paste into notepad (or other plaintext editor)
"But transportation issues are social-justice issues. The toll of bad transit policies and worse infrastructure—trains and buses that don’t run well and badly serve low-income neighborhoods, vehicular traffic that pollutes the environment and endangers the lives of cyclists and pedestrians—is borne disproportionately by black and brown communities."
ckafrica
Duke
Posts: 1139
Joined: Fri Mar 07, 2008 7:54 pm
Location: HCMC, Vietnam

Post by ckafrica »

Thanks for the tips, but neither of them will eliminate the present breaks of the columns. I'm trying to avoid having to remove them manually from a hundred pages of combined texts from three books.

I'm trying to compile all the character generation info for Gurps 4ed into a single text rather than shuffling through 5 books to find a slight variation of on the same ability or skill.

I have managed to finagle the fonts and column size on the page to look about right though.

As aa second query. If I were to print a PDF but wanted to get rid of the margin art, is there an easy way to do that? the gurps margin art is just a solid dark colored band which is both ugly and ink wasting
Last edited by ckafrica on Thu Dec 04, 2008 4:40 am, edited 1 time in total.
The internet gave a voice to the world thus gave definitive proof that the world is mostly full of idiots.
User avatar
Crissa
King
Posts: 6720
Joined: Fri Mar 07, 2008 7:54 pm
Location: Santa Cruz

Post by Crissa »

You might ask on the forums at steve jackson if they have a Lite 4e compiled version PDF yet.

If you're getting hard line breaks, there is no real way to undo it, except for a really advanced bot, for which they don't just hand out on the street, ya know?

-Crissa
User avatar
Josh_Kablack
King
Posts: 5317
Joined: Fri Mar 07, 2008 7:54 pm
Location: Online. duh

Post by Josh_Kablack »

Here's a help page for Open Office
Removing Line Breaks
Use the AutoFormat feature to remove line breaks that occur within sentences. Unwanted line breaks can occur when you copy text from another source and paste it into a text document.

This AutoFormat feature only works on text that is formatted with the "Default" paragraph style.

1.Choose Tools - AutoCorrect.
2.On the Options tab, ensure that Combine single line paragraphs if length greater than 50% is selected. To change the minimum percentage for the line length, double-click the option in the list, and then enter a new percentage.
3.Click OK.
4.Select the text containing the line breaks that you want to remove.
5.In the Apply Style box on the Formatting bar, choose Default.
6.Choose Format - AutoFormat - Apply.
Oddly you can use the search and replace with the \n regular expression to replace characters with hard line breaks, but it doesn't seem to work to search for hard line breaks to replace them with other characters.


And as further options, you might consider:

Linux FMT command (if you have easy access to the right distro), or learning enough PERL to write a script to remove hard line breaks that occur before you've counted a block of text N characters long.
Last edited by Josh_Kablack on Thu Dec 04, 2008 5:30 am, edited 1 time in total.
"But transportation issues are social-justice issues. The toll of bad transit policies and worse infrastructure—trains and buses that don’t run well and badly serve low-income neighborhoods, vehicular traffic that pollutes the environment and endangers the lives of cyclists and pedestrians—is borne disproportionately by black and brown communities."
ckafrica
Duke
Posts: 1139
Joined: Fri Mar 07, 2008 7:54 pm
Location: HCMC, Vietnam

Post by ckafrica »

JK: Yeah that did it thanks. Linux would have just made my brain hurt. I'm not the most IT savvy there is
The internet gave a voice to the world thus gave definitive proof that the world is mostly full of idiots.
User avatar
Crissa
King
Posts: 6720
Joined: Fri Mar 07, 2008 7:54 pm
Location: Santa Cruz

Post by Crissa »

In some search strings you can just paste a line with return in it. That's usually what I try.

-Crissa
Post Reply